Ranking-Based Black-Box Complexity* 

Benjamin Doerr Carola Winzen^ 

Max Planck Institute for Informatics, 66123 Saarbriicken, Germany 
{doerr I winzenjOmpi-inf . mpg . de 



Abstract 

Randomized search heuristics such as evolutionary algorithms, simulated annealing, and 
ant colony optimization are a broadly used class of general-purpose algorithms. Analyzing 
them via classical methods of theoretical computer science is a growing field. While several 
strong runtime analysis results have appeared in the last 20 years, a powerful complexity 
theory for such algorithms is yet to be developed. We enrich the existing notions of black- 
box complexity by the additional restriction that not the actual objective values, but only 
the relative quality of the previously evaluated solutions may be taken into account by the 
black-box algorithm. Many randomized search heuristics belong to this class of algorithms. 

We show that the new ranking-based model can give more realistic complexity estimates. 
The class of all binary- value functions has a black-box complexity of 0(log n) in the previous 
black-box models, but has a ranking-based complexity of Q{n). 

On the other hand, for the class of all OneMax functions, we present a ranking-based 
black-box algorithm that has a runtime of 0(n/logn), which shows that the OneMax 
problem does not become harder with the additional ranking-basedness restriction. 

Keywords: Query complexity; theory of randomized search heuristics; Mastermind; black- 
box complexity. 

1 Introduction 

Randomized search heuristics are general purpose algorithms for optimization problems. They 
include bio-inspired approaches such as evolutionary algorithms and ant colony optimization, 
but also classical approaches like random search or randomized hill-climbers. 

In practice, randomized search heuristics often are highly successful and thus extensively 
used [19]. They have the additional advantage that not too much understanding of the opti- 
mization problem at hand is needed, and that once implemented, they can easily be re-used for 
similar problems. 

One of the difficulties in using such heuristics is that it is very hard to predict which problems 
are easy for a suitable heuristic and which are generally intractable for randomized search 
heuristics. In contrast to a large body of empirical work on this problem, there has been much 
less theoretical work. This work mostly lead to results for particular problems and particular 
heuristics. Droste, Jansen, and Wegener [9] determined the runtime of the (1 -|- 1) evolutionary 
algorithm (EA) for several important test function classes. Another example is the work by 
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Neumann and Wegener [21], which shows that the (1 + 1) EA finds a minimum spanning tree 
using 0{m^ log m) function evaluations in connected graphs having m edges and polynomially 
bounded edge weights. 

Still, for a broader understanding of what are easy and difficult problems, a complexity the- 
ory similar to what exists in classical algorithmics would be highly desirable also for randomized 
search heuristics. The seminal paper by Droste, Jansen, and Wegener [lOj . introducing the so- 
called unrestricted black-box model, appears to be the first attempt to start such a complexity 
theory in the randomized search heuristics community. 

The paradigm that randomized search heuristics should ideally be problem-independent 
implies that the only way a randomized search heuristic can obtain problem-specific information 
is by evaluating solution candidates. This evaluation is done by an oracle that returns objective 
values, but reveals no further information about the objective function. An algorithm that has 
no access to the objective function (and thus has no access to the optimization problem to 
be solved) other than by querying objective values from such an oracle, is called a black-box 
algorithm. 

Given a class of functions J^, Droste et al. define the unrestricted black-box complexity 
of to be the minimum (taken over all black-box algorithms) expected number of function 
evaluations needed to optimize any function f £ J^. This number, naturally, is a lower bound 
on the runtime of any randomized search heuristic for the class 

In classical theoretical computer science, unrestricted black-box complexity is also studied 
under the notion of randomized query complexity. Many results for a variety of problems exist. 
Out of the many examples let us mention the problem of finding a local minimum of an unknown 
pseudo-Boolean function / : {0, 1}" — )• M. This problem has been studied intensively in the 
computer science literature, for deterministic algorithms (deterministic black-box complexity; 
cf., e.g., the work by Llewellyn, Tovey, and Trick jl8j ) and for randomized algorithms, for 
example by Aldous [2] , Aaronson [1] , and by Zhang |25] . Zhang gives a tight G(2'"/2^) bound 
for the randomized query complexity of finding a local minimum. 

Originally motivated by the coin-weighing problem, a much earlier work studying the unre- 
stricted black-box complexity of the generalized OneMax function class (definition follows) is 
the one by Erdos and Renyi jl2j . cf. Section HI This OneMax function class is also strongly 
related to the well-known Mastermind game, a game that has gained much attention from the 
computer science community. For example, Chvatal [1] studies a general version of this game 
with k colors and n positions. That is, the secret code is a length-n string z G {0, 1, . . . ,k — 1}"". 
Chvatal shows that for any constant number k of colors the codebreaker has a strategy reveal- 
ing the secret code using only 0(n/logn) guesses. In our notation this result is equivalent to 
saying that the unrestricted black-box complexity of the generalized OneMax function class is 
0(n/logn). The connection between unrestricted black-box complexity and randomized query 
complexity was seemingly overlooked so far in the randomized search heuristics community. 
Similarly, it seems that the community was not aware of the existing results for the Mastermind 
game. 

Unfortunately, it turned out that regarding all black-box algorithms leads to sometimes un- 
expectedly small complexities (obtained by not very sensible algorithms). As a trivial example, 
note that the unrestricted black-box complexity of any class of functions J- = {/} consisting 
of a single objective function is one. This is certified by the black-box algorithm that simply 
queries the optimum of /. 

This and further examples suggest that a restriction of the class of algorithms regarded 
might lead to more meaningful results. A major step in this direction is the work by Lehre and 
Witt [T7]. They introduce a so-called unbiased black-box model, which, among other restrictions 
to the class of algorithms regarded, requires that all search points queried by the algorithm must 
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be obtained from previous or random search points by so-called unbiased variation operators 
(see Section [2] for the full details). When only unary operators are allowed, this leads to a 
lower bound of J7(nlogn) for the complexity of any single-element class = {/} with / having 
a unique global optimum. This is, indeed, the typical runtime of simple search heuristics 
like randomized hill-climbers on simple function classes like strictly monotone functions; i.e., 
functions / : {0, 1}" — ?• M with /(x) < f{y) for all x,y € {0, 1}" for which (a) Xi = implies 
Ui = 0, 1 < i < n and (b) there is at least one index i such that Xj = 1 = 1 — y^. 

In this work, we shall argue that the unbiased model of Lehre and Witt is still not restrictive 
enough, and we will propose an alternative model. Let / : {0, 1}" — )• R, x i— )• 2^~^Xi be the 
binary-value function of the bit string x. Let BinaryValue* be the the class of functions con- 
sisting of / and all functions obtained from / by permuting the order of the bit positions and by 
flipping the meaning of the values of some bit-positions. In other words, BinaryValue* is the 
smallest class of functions containing / that is invariant under first applying an automorphism 
of the discrete n-dimensional hypercube {0,1}"". Then, as we shall show in this paper, the 
unbiased black-box complexity of BinaryValue* is at most [log2 n] + 2, if we allow the varia- 
tion operators to be of arbitrary arity. The corresponding black-box algorithm (see Section [5]) 
heavily exploits knowing the precise objective values of queried search points. 

This is what most randomized search heuristics do not do. They typically only use the 
objective values to compare search points. We define a black-box complexity notion referring to 
this paradigm by allowing the algorithms to only exploit the relative order of the search points 
queried so far. In other words, throughout the optimization process the black-box algorithm 
knows for any two search points x and y queried so far only whether f{x) < f{y), f(x) = f{y), 
or f{x) > f{y). In particular, it does not know the true values of f{x) and /(y). This 
model captures many commonly used randomized search heuristics, e.g., many evolutionary 
algorithms, hill climbers like Random Local Search, and ant colony optimization. 

We show that our ranking-based black-box model overcomes some drawbacks of the previous 
models. For example, for the binary-value function class BinaryValue* introduced above, both 
the otherwise unrestricted ranking-based black-box complexity and the unbiased ranking-based 
black-box complexity are of order 0(n) instead of O(logn) without the ranking restriction. In 
the 0(n) statement, the lower bound is clearly the more interesting one. This bound holds 
already for the subclass BinaryValue^ of BinaryValue* consisting of all functions 

n 

f, : {0, ir^R,x^Yl ® ' 

1=1 

z G {0,1}". 

The upper bound is easily verified by a simple hill-climber that, in arbitrary order, changes 
a single bit-value of the current solution and accepts the new solution if it is better than the 
previous one. In summary, we see that for this function class, the ranking-based black-box 
complexity seems to give us a more natural complexity measure than the previous approaches. 

We also analyze the ranking-based black-box complexity of a second function class that is 
often regarded in theoretical analyses of randomized search heuristics, namely the OneMax 
function class. Let OneMax„ be the class of all so-called OneMax functions 

f, : {0, 1}" ^ M, X ^ |{i G [0, n]nZ\xi = z,}\, 

z G {0, 1}". Hence, fzix) is the number of bit positions in which x and z coincide. Here, both 
the unrestricted and the unbiased black-box complexity are 0(n/ log n), which is slightly smaller 
than the Q{n log n) needed by most randomized search heuristics. The proofs of the 0(n/ log n) 
black-box complexity results again heavily exploit that the oracle returns the precise objective 
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values ("fitness values"). They all build on the beautiful idea that 0(n/logn) random search 
points together with their (precise) fitness determine the hidden objective function, cf. [3l[5tll2j. 
In spite of this, we present a ranking-based black-box algorithm that still solves the problem 
with 0(n/logn) queries. 

Our result on the OneMax class can also be interpreted in the context of the Mastermind 
game. As many authors (cf. [ Hll5j ) we regard the black-peg version of the game, where — instead 
of answering with black and white answer-pegs indicating in how many positions the secret code 
of the codemaker and the guess of the codebreaker coincide (black answer-pegs) and how many 
additional colors are correct but in the wrong position (white answer-pegs) — the codemaker 
does only reply with black answer-pegs. The ranking-based black-box complexity of OneA/Iax^ 
corresponds to the black-peg version of the Mastermind game in which the codemaker does 
respond to the codebreaker's guesses by providing a ranking of the guesses queried so far. 
This ranking is based on the number of black answer-pegs only. Then, as in the original 
generalized Mastermind game with black and white answer-pegs, the codebreaker still has an 
optimal winning strategy using only 0(n/logn) guesses. 

These two results show that in some cases, the additional restriction that only relative 
qualities of solutions may be taken into account does give more insightful problem difficulty 
estimates, whereas in other cases the ranking restriction does not change the unexpectedly low 
difficulty estimates given by the previous black-box models. 

We should note that there are two related research works in the literature. In [Hllinj) for 
certain classes T of functions the unrestricted black-box complexity of{/io / | / gJ^, /i:M— )• 
M strictly monotonically increasing} is regarded. This is closely related (see Section [3j) to the 
ranking-based black-box complexity, as introduced here, of the class T . The connection to 
black-box algorithms not exploiting the absolute function values, however, is not made there. 

Teytaud and co-authors |13tl23j give general lower bounds for the convergence rate of 
comparison-based and ranking-based evolutionary strategies in continuous domains. From these 
works results for discrete domains can be obtained. We do not see, however, that for such do- 
mains their lower bounds are stronger than the natural information-theoretic ones which were 
already observed in [SlfTO]. 

2 Preliminaries and Previous Black-Box Models 

In this section, we give a brief overview of two previous black-box models, the unrestricted 
black-box model by Droste, Jansen, and Wegener |10j and the more recent unbiased black-box 
model by Lehre and Witt [T7] . 

Let us first fix the notations used frequently throughout the paper. 

2.1 Notation 

The positive integers are denoted by N. For k £ N, we abbreviate [k] := {1, . . . , k}. Similarly, 
we define [0..A;] := [k] U {0}. For A;, £ G N we write [k ± £] := [k - £,k + £]n TL. 

Let n G N. For a bit string x = x\ . . .Xn G {0, 1}", we denote by x the bitwise complement 
of X, i.e., for all j G [n] we have Xj = 1 — Xy 

For n G N and j G [n] by e" we denote the j-th unit vector of length n. 

If X, y G {0, 1}'", we obtain the bit string x © y by setting, for each j G [n], (x © y)i := 1 if 
Xj / i/j, and (x © y)i := if Xj = yi. That is, © denotes the bitwise exclusive-or. We use the 
shorthand |x|;^ for the number of ones in the bit string x, i.e., \x\-y = X^ILi ^i- 

If / is a function and S a set, we write f{S) := {/(s) | s G 5}. We write id^ for the identity 
function of 5, i.e., ids'(s) = s for all s G 5. For n G N, the set Sn contains all permutations 
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of [n]. For a £ Sn and x G {0, 1}" we abbreviate a{x) := 2;o-(i) • • • ^o-(n)- 

For two real values a, 6 G M with a > b the interval [a, b] is defined to be the empty set. 
Lastly, we denote by In the natural logarithm to base e := exp(l). If we refer to a logarithm 

of a different base, we indicate this in the subscript; e.g., we write log2 for the binary logarithm. 
All asymptotic notation (Landau symbols, big-Oh notation) will be with respect to n, which 

typically denotes the dimension of the search space {0, 1}". 

2.2 Useful Tools 

Throughout the paper we shall apply several versions of Chernoff 's bound. The following can 
be found, e.g., in [TT] . 

Lemma 1 (Chernoff bounds). Let X = Y17=i-^i -^^"^ of n independently distributed 

random variables Xt, where each variable Xt takes values in [0, 1]. Then the following statements 
hold. 

Vt > : Fr[X > E[X] + 1] < exp{-2t'^/n) and Fr[X < E[X] - t] < exp(-2^V?^) , (1) 
Ve > : Pr [X < (1 - e) E[X]] < ex.p{-e'^ E[X]/2) . (2) 

We shall also use the following estimate on factorials. It is a direct consequence of Stirling's 
formula. The version presented below is due to Robbins |22j . 

Lemma 2 (factorials). For all n G N, 

^„n+l/2g-ngl/(12n+l) < ^, < V2^,^"+l/2e-ngl/(12n) _ 

2.3 Unrestricted and Unbiased Black-Box Complexity 

Usually, the complexity of a problem is measured by the performance of the best algorithm out 
of some class of algorithms, e.g., all algorithms which can be implemented on a Turing machine 

MM- 

What distinguishes randomized search heuristics from classical algorithms is that they are 
problem-independent. As such, the only way they obtain information about the problem to 
be solved is by learning the objective value of possible solutions ("search points"). To ensure 
problem-independence, one usually assumes that the objective function is given by an oracle 
or as a black-box. Using this oracle, the algorithm may query the objective value of any search 
point. Such a query returns the objective value of the search point, but no other information 
about the objective function. 

For simplicity, we shall restrict ourselves to real-valued objective functions defined on the 
set {0, 1}" of bit strings of length n. This is motivated by the fact that many evolutionary 
algorithms use such a representation. 

Naturally, we do allow that the algorithms use random decisions. From the black-box 
concept, it follows that the only type of action the algorithm may perform is, based on the 
objective values learned so far, to decide on a probability distribution over {0, 1}", to sample 
a search point x G {0, 1}" according to this distribution, and to query its objective value from 
the oracle. This leads to the scheme of Algorithm [H which we call an unrestricted black-box 
algorithm. 

In typical applications of randomized search heuristics, evaluating the fitness of a search 
point is more costly than the generation of a new search point. For this reason we take as per- 
formance measure of a black-box algorithm the number of queries to the oracle until an optimal 
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Algorithm 1: Scheme of an unrestricted black-box algorithm 

1 Initialization: Sample x^^^ according to some probability distribution p^^^ over {0, 1}" 
and query f{x^^^); 

2 Optimization: for t = 1, 2, 3, . . . do 

Depending on {{x^^\ f{x^^^)), . . . , fix^^~^^))) choose a probability distribution 

over {0, 1}" and sample x^*^ according to p^*-*; 
Query /(x^); 



search point is queried for the first time. Since we are interested in randomized algorithms, we 
regard the expected number of queries. 

Formally, for an unrestricted algorithm A and a function / : {0, 1}" — t- M, let T{A, f) G 
M U {oo} be the expected number of fitness evaluations until A queries for the first time some 
X G argmax/. We call T(A,f) the runtime of A for f or, likewise, the optimization time of A 
for f. We can now follow the usual approach in complexity theory. For a class J- of functions 
{0,1}" — M, the A-black-box complexity of F is T{A^F) := sup^gjr T(74, /), the worst-case 
runtime of A on T. Let ^ be a class of black-box algorithms for functions J-. Then the A-black- 
box complexity of T is T{A^J-) := inf/ig_4 r(A, J^), the minimum ("best") complexity among 
all A G ^ for T. If A is the class of all unrestricted black-box algorithms, we call T[A,J-) 
the unrestricted black-box complexity of T. This is the black-box complexity as introduced by 
Droste, Jansen, and Wegener [TO] . 

As mentioned in the introduction, it is easily seen that the class of all unrestricted black-box 
algorithms is very powerful. For example, for any function class J- = {/} consisting of one single 
function, the unrestricted black-box complexity of is 1. The algorithm that simply queries 
an optimal solution of / as first action shows this bound. 

This and related drawbacks of the unrestricted black-box model inspired Lehre and Witt [T7] 
to introduce a more restrictive black-box model, where algorithms may generate new solution 
candidates only from random or previously generated search points and only by using unbiased 
operators. Still this model contains most of the commonly studied search heuristics, such as 
many {^-\-\) and (/i. A) evolutionary algorithms, simulated annealing, the Metropolis algorithm, 
and Random Local Search. 

Definition 3 (/c-ary unbiased variation operator). Let A; G N. A /c-ary unbiased distribution 
{D{. I y^^\ . . . ) y^'^^))y(i),...,j;(fc)g{o,i}" a family of probability distributions over{f), 1}" such that 
for all inputs y^^\ ■ ■ ■ ,y^^^ G {0, 1}" the following two conditions hold. 

{i) Vx, z G {0, 1}" : D{x \ y^^^ , . . . , y^''^) = D{x ® z \ y^^) © z, . . . , y^''^ © z), 
[ii)yx G {0, 1}" Va G Sn : D{x \ y^'\ . . . ,yW) = D{a{x) \ a{y^'^), . . . ,a(y('=))) . 

We refer to the first condition as ©-invariance and to the second as permutation invariance. 
A variation operator creating an offspring by sampling from a k-ary unbiased distribution is 
called a /c-ary unbiased variation operator. 

Note that the only 0-ary unbiased distribution over {0, 1}" is the uniform one. 1-ary opera- 
tors, also called unary operators, are sometimes referred to as mutation operators, in particular 
in the field of evolutionary computation. 2-ary operators, also called binary operators, are of- 
ten referred to as crossover operators. If we allow arbitrary arities, we call the corresponding 
black-box model the *-ary unbiased black-box model. 
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k-ary unbiased black-box algorithms can now be described via the scheme of Algorithm [5J 
The k-ary unbiased black-box complexity of some class of functions is the complexity of J-" 
with respect to all k-ary unbiased black-box algorithms. 



Algorithm 2: Scheme of a /c-ary unbiased black-box algorithm 

1 Initialization: Sample x^^'^ G {0, !}"■ uniformly at random and query f{x^'^^); 

2 Optimization: for t = 1, 2, 3, . . . do 

Depending on (^f{x^^^), . . . , /(x^*""*^')) choose k indices ii, . . . ,ik G [0..t — 1] and a 
k-ary unbiased distribution {D{. \ y(^), . . . , y*'''"''))y(i)_,,,_y(fe)g{o,i}"; 
Sample x^*^ according to D{. \ x^'^'^\ . . . ,x^^'^'^) and query /(x^*^); 



As we mentioned in the introduction, Lehre and Witt [T7j proved, among other results, 
that all functions with a single global optimum have a unary unbiased black-box complexity of 
il(nlog?i). For several standard test problems this bound is met by classical unary randomized 
search heuristics such as the (1-1-1) evolutionary algorithm or Random Local Search. Recall 
that, as pointed out above, the unrestricted black-box complexity of any such function is 1. 



3 The Ranking-Based Black-Box Model 

Since many standard randomized search heuristics do not take advantage of knowing the exact 
objective values but rather take into account only the relative quality of search points, Nikolaus 
Hansen (INRIA Saclay, France; personal communication) suggested to develop a corresponding 
black-box model. In fact, many heuristics create new search points based only on how the 
objective values of the previously queried search points compare. That is, after having queried 
t fitness values f{x^^^), . . . , f{x^^~^^), they rank the corresponding search points x^^\ . . . , x^^~^^ 
according to their relative fitness. The selection of input individuals x^*^^ . . . , x^*'') for the next 
variation operator is based solely on this ranking. 
We define the ranking induced by / as follows. 

Definition 4 (ranking induced by /). Let S be a set, let f : S be a function, and let C be a 
finite subset of S. The ranking pc of C induced by f assigns to each element c£ C the number 
of elements in C with a smaller f -value plus 1, formally, pc{c) := 1 -|- \{c! S C \ f{c') < /(c)}|. 

Note that two elements with the same /-value are assigned the same rank. 

As discussed above, when selecting a parent population for generating new search points, 
many randomized search heuristics do only use the ranking of the search points seen so far. In 
this work we are interested in how this fact influences the complexity of standard test function 
classes. Therefore, we regard here the restricted class of black-box algorithms that use no other 
information than this ranking. This yields the following black-box models. 

The unrestricted ranking-based black-box complexity of some class of functions is the com- 
plexity with respect to all algorithms following the scheme of Algorithm [3 

Similarly, the k-ary unbiased ranking-based black-box complexity of some class of functions 
is the complexity with respect to all algorithms following the scheme of Algorithm HI 

Both ranking-based black-box models capture many common search heuristics such as evo- 
lutionary algorithms using elitist selection, ant colony optimization, and Random Local Search. 
They do not include algorithms like simulated annealing, threshold accepting, evolutionary 
algorithms using fitness-proportional selection, or the Metropolis algorithm. 
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Algorithm 3: Scheme of an unrestricted ranking-based black-box algorithm 

1 Initialization: Sample x^^^ according to some probability distribution p^^^ over {0, l}*^ 
and query f{x^^^); 

2 Optimization: for t = 1, 2, 3, . . . do 

Depending on the ranking of {x^^\ . . . ,x^^~^^} induced by /, choose a probability 
distribution p^^^ over {0, 1}" and sample x^^^ according to p^^^; 
Query the ranking of {x^^\ . . . induced by /; 



Algorithm 4: Scheme of a k-aiy unbiased ranking-based black-box algorithm 

1 Initialization: Sample x^^^ G {0,1}" uniformly at random and query /(x*-^-*); 

2 Optimization: for t = 1, 2, 3, . . . do 
Depending on the ranking of {x^^^ . . . induced by /, choose k indices 
ii, . . . G [0..t — 1] and a fc-ary unbiased distribution 

{D{. I y(^\...,y(*^)))y{i),...,j,we{o,i}n; 

Sample x*-*^ according to D(. \ x^*^\ . . . jX^**^^) and query the ranking of 
{x^'^^ . . . jX^*^} induced by /; 



To distinguish the unrestricted and the unbiased black-box model from their ranking-based 
counterparts, we shall sometimes refer to them as the basic unrestricted black-box model and 
the basic unbiased black-box model, respectively. 

When working with the ranking-based models, the fact that the rank of a search point 
X varies with the number of already queried search points may be distracting. However, the 
ranking-based models can be equivalently modeled via an unknown adaptive monotone pertur- 
bation of the fitness values. By this we mean that instead of ranking all previously queried 
search points, the oracle may as well reply to any query x^*^ with some value g{f{x^^^)), where 
/ is the secret function to be optimized and (7 : M — )• M is a strictly monotone function that 
depends on all search points x^^^ . . . , x^*) queried so far. 

To make this model more precise, let us first recall that a function is said to be strictly 
monotone if for all a < /3 we have g{a) < g{(3)- We show how the oracle can construct such 
a strictly monotone function "on the fly", preserving the ranking of the search points. Let 
A be a black-box algorithm. When algorithm A queries a search point x^''^ for initialization, 
the oracle responds to A with "(51 o /)(x(''^) = 0". That is, it sets gifix^'^^)) := 0. For any 
iteration t, if algorithm A queries x^*^ the oracle returns to A the value 

5(/(xW)) = 

'c/(/(xW)), if /o(xW) = p(x(^)) for some i € [0..t - 1] , 

max{5(/(xW)) | i G [0..t - 1]} + 2", if /3(xW) = 1 , 
< min{5(/(xW)) | i G [0..t - 1]} - 2", if p(xW) = t + l, 
(^(/(xC^))) - |5(/(x»))|) /2, if p(xW) = max{/)(xW) | p(xW) < p(xW)} 

and /9(xW) = min{/>(x(^)) | p{x^^^) > p{x^^^)} , 

where we abbreviate p := P{x(i)\i£[Q..t]}y ranking of {x^*-* | i G [0..i]} induced by /. It is easily 
verified that indeed we have 5(/(x»)) > ff(/(x(j))) if and only if /(x«) > /(x^^)). I.e., g can 
be extended to a strictly monotone function M — t- M. 
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As any stage of the run of the black-box algorithm, the information revealed by the {g o /)- 
values and the information revealed by the ranking of the search points is the same. Therefore, 
the two models are equivalent. 

We sometimes refer to the model with a (g o /)-oracle as the (unrestricted or unbiased, 
respectively) monotone black-box model. In particular for proving upper bounds this model is 
more convenient to work with. 

Convention. In what follows, we shall always denote by g the monotone perturbation. 
That is, g is the function, which is used for representing the ranking of the already queried 
search points. 

In [HllTO], Droste, Jansen, Tinnefeld (only [8]) and Wegener implicitly regard a different 
notion of black-box complexity without exploiting absolute fitness values. For a given class 
of functions / : {0, 1}" — ?• M, they regard the unrestricted black-box complexity of the monotone 
closure := {h o f \ f ^ J^,h : M. ^ M strictly monotonically increasing} of J^. Clearly, the 
optimal search points of / and h o f are the same. Moreover, the relative quality of two 
search points is the same under the fitness function / and ho f. Hence a black-box algorithm 
optimizing h o f , since h is unknown, is in a similar position as a ranking-based black-box 
algorithm optimizing /. 

Unfortunately, contrary to the first believe, it is not so obvious whether the unrestricted 
black-box complexity of J- and the ranking-based black-box complexity of J- are the same. This 
has been informally argued in [8l|10], but we currently do not see a rigorous proof for such a 
statement. The difficulty is that, theoretically, a black-box algorithm optimizing an unknown, 
but from that point on fixed, h o f might acquire a probabilistic knowledge on h and exploit 
this in future queries. This might put the algorithm in a better position as in the adaptively 
monotonically perturbed model described above. 

This problem does not exist if we only regard deterministic black-box algorithms. Here we 
may adaptively change the function h during the optimization progress and argue that in fact 
we could have started with this h. This argument is not possible for randomized algorithms 
where the distribution of the answers obtained so far has to be taken into account. Likewise, 
we cannot invoke Yao's minimax principle since J- is not finite. 

For these reasons, we currently do not know whether the black-box complexity of J- and 
our ranking-based black-box complexity of J- are the same or not. Clearly, the ranking-based 
black-box complexity is always not less than the black-box complexity of J-. 

Given that we are not sure that both models agree, [UlTO] show that the unrestricted black- 
box complexity of the monotone closure of BinaryValue„ is at least of order n/logn and at 
most of order n + 2. 

4 The Ranking-Based Black-Box Complexity of OneMax 

A classical easy test function in the theory of randomized search heuristics is the function 
OneMax, which simply counts the number of 1-bits, ONEMAx(a::) = ^^^^iXi. The natural 
generalization of this particular function to a non-trivial class of functions is defined as follows. 

Definition 5 (OneMax function class). For z G {0, l}*^ let 

OM2 : {0, l}'' [0..n],x ^ OMz{x) = \{i G [n] \ Xi = Zi}\ . 

The string z = argmaxOM^ is called the target string of Om^. Let OneMax^ := 
{Om^ I z G {0, 1}"} be the set of all generalized OneMax functions. 

Our main result is the following. 
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Theorem 6. The unary unbiased ranking-based black-box complexity of OneMax^ is 
0(nlogn). For 2 < k < n, the k-ary unbiased ranking-based black-box complexity o/OneMax„ 
is 0(n/ log k). 

For k = n^(^) this statement is asymptotically optimal since already for the unrestricted 
black-box complexity a lower bound of 0(n/logn) has been shown by Erdos and Renyi [T^ 
and independently by Chvatal 0] and again later also by Droste, Jansen, and Wegener [TO] , 
Also independently of each other, Erdos and Renyi [12], Chvatal [1], and Anil and Wiegand [3] 
proved a matching upper bound. This shows that the unrestricted black-box complexity of 
OneMax„ is 0(n/logn). 

The unary unbiased black-box complexity of OneMax^ is 0(nlogre) [T7]. Higher arity 
models were studied in [5]. The authors prove that for 2 < k <n the A;-ary unbiased black-box 
complexity of OneMax„ is 0(n/logA;). Theorem [6] shows that we can achieve the same bound 
in the (much weaker) unbiased ranking-based model. 

To ease reading, we split the proof of Theorem [6] into three parts. The first part, Section [57T1 
is the easiest. It deals with constant values of k. We show that any class of generalized strictly 
monotone functions can be optimized by a ranking-based algorithm in 0(n) queries using only 
binary variation operators. OneMax„ is such a class of generalized strictly monotone functions. 
For the unary setting, Theorem [6] follows from the facts that (i) already the basic unbiased unary 
black-box complexity of OneMax^ is 17(77, log n) [T7] and that (ii) that Random Local Search 
is a unary unbiased ranking-based algorithm which optimizes OneMax„ in 0(77log?i) queries. 

The second case, covered in Section 14.21 is the most interesting one. We show that 
for k = n there exists an unbiased ranking-based algorithm which optimizes every function 
Om^ G OneMax „ using only 0(77/ log 77) queries. More precisely, we show that after 
0(77/ log 77) samples chosen from {0,1}" independently and uniformly at random, with high 
probability, it is possible to identify the target string z. This random sampling idea is also 
the basis for the previous results on OneMax^ [3H5lll2j. However, we need to be more careful 
here as we require all variation operators to be unbiased, and furthermore, other than in all the 
previously mentioned works, we do not have exact knowledge of the fitness values. 

Lastly, we show how to deal with the case of arbitrary k € oj{l)^k < n. We prove that, 
for any such k, we can independently optimize substrings ("blocks") of size k in 0(A;/logA;) 
iterations, using only fc-ary unbiased variation operators. We optimize these blocks sequentially. 
Since there are Q{n/k) such blocks of length k, the desired 0(r7/logfc) bound follows. These 
are Sections 14.31 and 1 4.41 

4.1 Proof of Theorem [6] for Constant Values k 

As mentioned above, for k = 1 the lower bound in Theorem [6] follows from |17l Theorem 6], 
which states that the unary unbiased black-box complexity of any class of functions {0, 1}" — )• M 
having a single global optimum is i}{nlogn). Clearly, OneMax„ is such a class. Since the k- 
ary unbiased black-box complexity of any class of functions is a lower bound for the A;-ary 
unbiased ranking-based black-box complexity of J-, this also shows that the unary unbiased 
ranking-based black-box complexity of OneMax„ is Q{nlogn). 

The upper bound is certified by a simple hill-climber. Random Local Search (cf. Algo- 
rithm [5|). It is easily verified that the variation operator implicit in the mutation step is a 
unary unbiased one. The selection depends only on the ranking of the current search point x 
and its neighbor y. Hence, Random Local Search (often abbreviated RLS) is a unary unbiased 
ranking-based black-box algorithm. By the coupon collector's problem (cf. [2^ or any other 
textbook on randomized algorithms) the expected runtime of RLS on any OneMax„ function 
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Algorithm 5: Random Local Search for maximizing /: {0, 1}" — t- M. 



1 Initialization: Sample x G {0, 1}" uniformly at random and query /(x); 

2 Optimization: for t = 1, 2, 3, . . . do 

3 Choose j G [n] uniformly at random; 

4 Set y ^ X ® e" and query f{y); / /mutation step 

5 if f{y) > f{x) then x ^ y ] / /selection step 



is O(nlogn). This concludes our comments on the unary unbiased ranking-based black-box 
complexity of OneMax„. 

For k > 2 we prove the more general statement that all classes of generalized strictly 
monotone functions have a binary unbiased ranking-based black-box complexity that is at most 
linear in n (Lemma [7] below). 

Following the standard notation, we write x ~< y if for all i G [n] we have Xj < yi and if there 
exists at least one i £ [n] such that Xi < yi. A function / : {0, 1}" — )• M is said to be strictly 
monotone if for all x,y £ {0, 1}" the relation x < y implies /(x) < f{y). 

We extend this notation as follows. For all z G {0, 1}" we call / : {0, 1}" — ?• M strictly 
monotone with respect to z if the function : {0, 1}" — t- M, x i— )• f[x © z) is strictly monotone. 
As is easy to verify, we have argmax/ = z. 

Let be a class of real- valued functions defined on {0, 1}". We call T a class of generalized 
strictly monotone functions if for all / G -F there exists a z G {0, 1}" such that / is strictly 
monotone with respect to z. We note that for all z G {0, 1}" the function Om^ is strictly 
monotone with respect to z. 

In [5] the authors argue that the unbiased binary black-box complexity of any class of 
generalized strictly monotone functions is 0{n). Since a formal proof is missing in [5], for 
completeness, we give a full proof here. It follows closely the ideas presented in [5]. 

Lemma 7. Let k > 2 and let T he a class of generalized strictly monotone functions. The 
k-ary unbiased black-box complexity of J- is at most An — 5. 

For proving Lemma [7] we show that two bit strings x,y £ {0, 1}" suffice for encoding which 
bits have been optimized already. We start with two complementary bit strings y = x and 
throughout the run of the algorithm we ensure that Xi = yi holds only if the entry Xj in position 
i is known to equal the entry Zi of the target string. For each bit individually, we test whether 
it should be set to zero or to one. This yields the linear runtime. We show that all this can be 
done using binary operators only. 

Proof of Lemma Note that it suffices to prove the statement for k = 2 since for any i > k 
the i-aiy unbiased black-box complexity is bounded from above by the k-ary one. We claim 
that Algorithm [6] certifies Lemma [3 

First we note that the selection/update step (lines 7,8,11) depends only on the rankings of 
the search points. Therefore, Algorithm [6] is a ranking-based one. 

Apart from the uniform sampling variation operator. Algorithm [6] makes use of the following 
variation operators. 

• coniplement(-) is a unary variation operator which, given some x G {0,1}", returns 
coniplement(x) := x, the bitwise complement of x. 

• f lipOneWhereDif f erent(-, •) is a binary variation operator which, given some x,y £ 
{0,1}", first chooses uniformly at random a bit position j G {i G [n] \ Xj / yi}. With 
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Algorithm 6: A binary unbiased ranking-based black-box algorithm for optimizing gen- 
eralized strictly monotone functions f € J-. 

1 Initialization: Sample x € {0, 1}" uniformly at random and query g{f{x)); 
1 Set y ^ complement(x) and query g{f{y)); 
3 Optimization: for t = 1, 2, 3, . . . do 

4 
5 
6 
7 
8 



9 
10 
11 



Sample w ^ f lipOneWhereDif f erent(x, y) and query g{f{w)); 
if 5(/H) > g{f {x)) then 

Set w' ^ disti(x,w) and query g{f{w')); 

if g{f{w')) = g{f{x)) then Update x ^ w \ j jx and w differ in 1 bit 
else if g{f{w)) > g{f{y)) then Update y w; j jy and w differ in 1 bit 

else if g{f{w)) > g{f{y)) then 

Set w' ^ disti{y,w) and query g{f{w')); 

if g{f{w')) = g{f{y)) then Update y ^ w ] j jy and w differ in 1 bit 



probability 1/2, it returns x(Be^ and with probability 1/2 it returns y©e". That is, with 
equal probability f lipOneWhereDif f ereiit(x, y) either flips exactly one bit in x, in which 
X and y differ, or it flips one such bit in y. 

• disti(-,-) is a binary operator which, given some x,w £ {0,1}"" returns disti(x,?u) = x 
if the Hamming distance |x © of x and w equals 1 and it returns w otherwise. 

It is easily verified that compleinent(-) is an unbiased unary variation operator. This follows 
from the fact that x (B w = x (B w for all x,w G {0, 1}" and that a{x) = a{x) for all a G Sn- 
By a similar reasoning and obeying the fact that the position to be flipped is chosen uniformly, 
one can easily show that f lipOneWhereDif f erent(-, •) is unbiased as well. Lastly, we also 
have a{disti{x,w)) = disti((T(x), iT(^y)) and disti(x ® y,w ® y) = disti(x,w) © y for all 
x,y,w € {0, 1}" and all a G 5„. This shows that disti(-, •) is also unbiased. 

For proving that Algorithm [6] indeed certifies Lemma [71 let us fix a function f £ J- and let 
us assume that / is strictly monotone with respect to some fixed z S {0, 1}"'. 

We show that throughout the run of Algorithm [6] the following invariant holds: for all 
i S [n] we have Xj = yi only if Xj = Zj. After initialization we have Xj 7^ yi for all i £ [n]. 
Hence, by construction, the invariant is satisfied. Once we accept a bit fiip of position i £ [n], 
we necessarily have Xj = yi. Hence, the bit will not be flipped in any further iteration of 
the algorithm. Furthermore, for any two bit strings x,w £ {0, l}" with Hamming distance 
|x © w\i = 1 we have f{w) > /(x) (and, by definition of g, g{f{w)) > g{f{x))) if and only if 
Wi = Zi where i £ [n] is the one position in which x and w differ. This shows that the invariant 
is always satisfied. 

Next we show that Algorithm [6] terminates and that the expected runtime is at most 4n — 5. 
First we bound from above the number of queries needed to reduce the Hamming distance 
of X and y from n to two. To this end, let x,y £ {0,1}" with |x — y\i > 2. Then, for 
w = f lipOneWhereDif ferent(x, y) either we have |x © w\i = 1 or |y © w\i = 1. Both events 
are equally likely and exactly one of them yields an update of x or y, respectively. Therefore, 
the probability of an update equals 1/2. If we update any one of the two strings, the Hamming 
distance of x and y decreases by 1. This implies an expected number of 2(n — 2) iterations for 
decreasing the Hamming distance of x and y from n to 2. Any such iteration requires at most 
two queries, the one for g{f{w)) and the one for g{f{w')). Therefore, on average, we need at 
most 4(n — 2) queries to reduce the Hamming distance from x and y from n to 2. 
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Once the Hamming distance of x and y is reduced to two, either we have z S {x, y} or we 
have that both |x©2;|i = 1 and = 1. By the random initiahzation of x and the fact that 

we replace x and y only by bit strings of Hamming distance one, all bits Xi for which Xj 7^ yi 
satisfy Pv[xi = 1] = Pr[xj = 0] = 1/2 and, likewise, Pr[?/j = 1] = Pr[yj = 0] = 1/2. Therefore, 
the event z S {x, y} occurs with probability 1/2. In this case we are done, because both g{f{x)) 
as well as g{f{y)) have been queried already. Therefore, let us assume that z ^ {x,y}. Let 
again w = f lipOneWhereDif f erent(x, y). Then both |x © = 1 and |y © w|i = 1 must 
hold. Since only two such strings with Hamming distance one from both x and y exist, we have 
Fr[w = z] = 1/2. Furthermore, g{f{w)) > g{f{x)) or g{f{w)) > g{f{y)) holds only if w = z. 
That is, if w ^ z, then neither x nor y will be updated. Therefore, once \x (B y\i = 2, it takes, 
on average, 1/2-0+1/2-2 = 1 query until Algorithm [6] queries z = arg max / for the first time. 

Together with the two queries needed for initialization (lines 1 and 2 of Algorithm E]) , the 
runtime of our algorithm can be bounded from above by 2 + 4(n — 2) + l = 4n — 5. □ 

4.2 Proof of Theorem [6] for k = n 

We show that there exists a ranking-based black-box algorithm which optimizes any function 
Om^ e OneMax„ in 0(n/logn) iterations. 

To ease reading of the following description, let us already fix here some unknown function 
OM2 G OneMax„,. In order to optimize Om^, the algorithm has to query the target string 

z e {0,1}^^. 

We work with the monotone model, i.e., whenever the algorithm queries from the oracle a 
search point x, it receives from it the value g^OM^ix)^ . 

The rough description of the algorithm certifying Theorem [6] for A; = n is fairly easy. It 
first samples s € 0(n/ log n) search points xi,...,Xs from {0,1}"' mutually independent and 
uniformly at random. We show that, with high probability, knowing only the {g o OM^)-values 
{g(^OMz{xi)) I i G [s]} suffices to create the target string z using only two additional (unbiased) 
iterations. These last two queries, however, require some technical effort. This is the main part 
of this section. 

In what follows, let k > 2 be a constant, let /3 := e~^'^ {2^/^^)'^^, and let a be a constant 
that is at least 8 ^1 — 2e~'^^^^ . Furthermore, let s := an/Inn and let xi, . . . ,Xs be sampled 

from {0, 1}" independently and uniformly at random. 

We divide the proof of Theorem [6] for the case k = n into three steps. We feel that the 
intermediate steps are interesting on their own. Each of the following statements holds with 
exponentially small probability of failure. In particular, they hold with probability at least 
1 — o(n~^) for all constant values of A. 

• First we show that for each £ G it ^^^/n\ there is at least one bit string Xj such that 
OMz{xi) = I. Furthermore, the set of all samples with OM^-value in it Hy/n] has size 
at least §(1 - 26" 2''"). 

• In the second part we show how to identify g{^) and how this knowledge suffices to 
identify the interval g{[^ it Ky/n\). This allows us to calculate Om2(x) for all x with 
OMz{x) S ± K^/n]. That is, for such x we are able to undo the monotone perturbation 
caused by g. 

• Lastly, we show that there does not exist any y € {0, 1}"'\{2;} with OMy(x) = Om2(x) for 
all such samples x G {xi, . . . , Xg} with OMz{x) € [§ ± K^/n\. Thus, we can unambiguously 
determine z. 
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Part 1: Flooding the interval it Ki/n] 

In the next two lemmata we show that, with probability at least 1 — 3K^/nexp{—^^^), for all 
£ G lb Ky/n\ there exists at least one i G [s] such that OMz{xi) = i. 

Lemma 8. Let ^ G =b ^^^/n\ and let x be sampled from {0, l}" uniformly at random. For 

large enough n we have that Pr[OM^(x) = ^] > /3n~^/^. 

Proof. Clearly, Vi[Om^{x) = £] = (")2-". Thus, we have to prove that (") > /32"n-i/^ for 
large enough n. Let 7 G [— k, +k] such that I = + ^^/n. 

By definition we have (^) = j^^, = („/2+^^)?(„,/2-7v^)! • Lemma [2] we can bound 



n 



71+1 ^ 



2 ' W - V2 
n ^\ . , — / n ' 



2 ^V^) ! < V2^ - ^V^) 2 ,-(§-7V^),l/(12(§-7VH)) 



We rewrite 



(*) 



where term (*) equals ( (1 + 1 • This term converges to e . For all n, term (*) can 

Kp Kminrlprl frnm ahnvp Iiaa p^T 



be bounded from above by e 
Similarly we rewrite 



77-1-1 

n+1 p- rt+1 ^ / r. \ —'Yx/ri / o \ — h— 

n \— -7v^ -7v^/ 27 \ / 27 \ 2 



where term (**) equals I f 1 ~ 1 • This term also converges to e'^"' . For any n, 

expression (**) can be bounded from below by e^'*'^. However, by convergence, there exists a 
no G N such that for all n > no we have (l — '^^^ < 2e^'^^ . 
Finally, let us note that, 

71-1-1 71+1 n+1 

n / \ \ n \ n 



and, for large enough n, 

gl/(12(§+7v^))gl/(12(f-7v^)) ^ gl/(3n-1272) < 

Altogether we obtain for large enough n that 

n\ 2"+i 2^ T- 



□ 
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By applying a ChernofF bound (Lemma [TJ to the result of Lemma [51 we immediately obtain 
the following. 

Corollary 9. Pr [V£ G [§ ± 3i G [s] : OM,{xi) =£]>!- 3K^/Ee^p{-^^) . 

Proof. Throughout this proof assume that n is sufficiently large. 

Let ^ G =b Ky/n\. For all i G [s] let Xf be the indicator variable of the event OMz{xi) = i. 
Let := Ylt=i^i- By Lemma [8] we have E[X^] > sjin-^/'^ = apn^/'^ In'^ n. By applying a 
ChernofF bound (cf. Lemma [H ^) we derive 

Pr[X^ < ^] < Pj.[^^ ^ 1 ^x^]] < exp(-i E[X^]) < exp(-^^ni/2 _ 

The statement follows from a simple union bound argument. The probability of not 
sampling at least one of the values in it n^/n] can be bounded from above by i^K^fn + 
l)exp(-^f ni/2ln-in). □ 

We conclude the first part by the following elementary lemma, which again is not best 
possible, but good enough for our purposes. It shows that almost one half of the s samples 
xi, . . . , lie in the interval ± K\/v\. 

Lemma 10. (i) If x is drawn from {0,1}" uniformly at random, then Pr[0M2(x) G ± 
K^M] > l-2e"2'''. 

(ii) Let S be the number \{i G [s] \ OMz{xi) G =b K-^/n]}! of samples with OMz-value in 
[§ ± With probability at least 1 - exp(- ^°" ) we have 5 > f (1 - 2e"2«^'). 

Proof. Let x be drawn from {0, l}" uniformly at random. Then, by Chernoff 's bound (cf. ([1]) 
in Lemma [T|), 

Pr[OM^(x) G [f ± K^n]] = 1 - Pr[|OM^(x) - E[0m^(2;)]| > K^/^ > 1 - le'"^"^ . 

This shows (i). Furthermore, we expect at least (1 — 2e~^'*^)s samples to have an OM^-value 
in lb K.^yn]. By again applying a Chernoff bound (cf. ([T]) in Lemma [T]), we bound 

Fr[S < i(l - 2e^2K')s] < Pr[5 < i E[S]] < exp(-i E[S]) = exp(-i(l - 2e-^''^)s) . 

□ 

Part 2: Identification of g{^) and of g [[^ ± K^/n\) 

From the previous part we know that after drawing s = an /Inn samples independently and 
uniformly at random, we can assume that for each value £ G =b Ky/n\ there exists at least one 
i G [s] such that OM^(xi) = i. Furthermore, we have bounded the number of samples that fall 
into the interval it K^/n\. As we shall see in the third part of this section, if we could identify 
these samples with OM^-value in ± ^^^/n], then, with high probability, we could determine 
the target string z. In this part we show that on top of the s samples xi, . . . ,Xs we need only 
one additional query to determine g{^). Once we have identified the value <?(§), from Part 1 
we infer that we also learned g{i) for all £ G =b K^/rl\. 

We first explain how to identify g{^)- We do this by exploiting the strong monotonicity of g. 
To be more precise, we make use of the fact that g preserves the element defining the median 
of a set of objective values. That is, if element v is the median of a multi-set {vi, . . . , vt}, then 
g{v) is a median of the multi-set {g{vi), . . . ,g{vt)}. Here in our context we define the median 
of a finite multi-set S to be the smallest value v £ S such that the number of elements in S 
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which are smaher or equal to v is at least half the size of S. Formally, v = min{£ £ S \ \{s £ 
S \ s < £} > \S\/2}. For our context we set 

m := mm{e G [0..n] | \{i G [s] \ OM^(xi) < £}\ > n/2} . 

For all statements that follow, we assume that n is large enough. 

Lemma 11. The probability that m' G =b ^/n\ is at least 1 — 2exp(— 2a/3^n/lnn). 

Proof. We bound the probability that more than s/2 samples have an OM^-value that is less 
than ^ — ^yn and we bound the probability that more s/2 samples have an OM^-value that is 
larger than ^ + ^/n. 

Let X G {0, 1}" be sampled uniformly at random. By symmetry, for all 7 it holds that 

Pr[OM,(x) = § + 7] = Pr[OM,(x) = § - 7] ■ (3) 
Furthermore, by Lemma [8] we have 

1 r- 
-2+^/11 

Pr [Om^(x) ^ [§ ± ^/n\\ = 1 - Pr[OM^(2;) = i] < 1 - {2^ + l)fin-^l^ 

<1-2P. (4) 

Equations ^ and imply Pr[OM2(2;) < f — < ^ — /3. By linearity of expectation 
we can thus bound the expected number X of samples with OM^-value less than ^ — ^/n by 
sil-13). 

From the Chernoff bound ([TJ in Lemma [1] we derive 

Pr [X > §] < Pr [X > E[X] + s/3] < exp ( - 2s'^f3^/s) = exp(-2a/3^n/ Inn) 

By symmetry, the same reasoning proves Pr [i^ > |] < exp(— 2a/3^n/lnn) for the number 
Y of samples with OM^-value larger than ^ + i/n. The statement follows from a union bound 
over the two events X > ^ and Y > ^. □ 

With Lemma[TT]at hand, the identification of g{^) and g ([^ it K^/nfj is easy. 

Lemma 12. If we know the {go OMz)-values of xi, . . . , Xs, we can apply a unary unbiased 
variation operator to one of these samples in order to create one additional search point x' such 
that after querying g{OMz{x')) we can identify ^([f]) and g ([[§] ± K^/n\'), with probability at 

least 1 — c^/nexp{—^^^) for some constant value c. 

Proof. Let m be the median of the multi-set {g{OMz{xi)), . . . , g{OMz{xs))}. Since g is a strictly 
monotone function, we have m = g{m'). Lemma [11] yields mG5([[^]=b ^/n\), with probability 
at least 1 — 2exp(— 2a/3^n/lnn). 

According to Lemma[8l there exists a sample Xi G {xi, . . . ,Xs} such that g{OMz{xi)) = m, 
with probability at least 1 — 3K-y/reexp(— g^^ ). We show how sampling the bitwise complement 
1^ of Xi reveals 5(f). 

Before we prove this claim let us first note that xj can be obtained from Xi by the unary 
unbiased variation operator coniplenient(-) which we have introduced in the proof of Lemma[71 
For even values of n, our algorithm to identify g{^) is Algorithm [71 Here we denote by 

median(5f(OM^(y)),5-(OM^(xi))|s'(OM^(xi)),... ,5r(OM^(2;^))) 
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Algorithm 7: Identifying g{^) for even values of n. 

1 Sample i £ {j £ [s] \ g{OMz{xj)) = m} uniformly at random; 

2 Set y ^ complement(x) and query g{OMz{y)); 

3 if g{OMz{y)) = g{OM^{xi)) then nig ^ m; 

4 else nig ^ median(5f(OM^(2/)), 5f(OM^(xi))|c/(OM^(xi)), . . . , g{OM^{xs))); 

5 output lUg 



the median of the set (not the multi-set !) 

{5(Om,(xi)), . . .,g{OM,{xs))} n {[g (OM,(xi)) , g (OM,(y))] U [g (OM,(y)) , g {OM,{xi))]) . 

To show the correctness of Algorithm [TJ let us first assume that g{OMz{y)) = g{OMz{xi)) 
holds. Since 5 is a strictly monotone function, this implies OMz{y) = OMz{xi). But then 
m' = OMz{xi) = ^ must hold by the symmetry property that we mentioned already in equation 
^ in the proof of Lemma [TTl 

Therefore, we may assume without loss of generality that g {OMz{xi)) 7^ g {OMz{y)). As 
mentioned above, by Lemma II 11 with probability at least 1 — 2exp(— 2a/3^n/lnn) we have 
OMz{xi) = g~^{m) G [^±y^]. So is g{OMz{y)) by the symmetry of the Om^; function (equation 
([3])). The symmetry also implies | = {OMz{xi) + OM^(y))/2. 

Assume OMz{xi) < OMz{y)- Then ^ is exactly the median of the integer values 
in [OMz{xi),OMz{y)]- But since we have — by Corollary [9] — with probability at least 1 — 
3KV^exp(-^) 

[OMz{xi), OM^(y)] C {OMz{xi), Om^(x^)} , 

the median of the integer values in [OMz{xi), OMz{y)] equals the median of the integer values 
{Om^(2;i), . . . , Om^(2;^)} n [OMz{xi), OM^(y)]. 

Since g is a strictly monotone function we also have, with the same probability, 

g {[OUzixi), OUziy)]) C {g (Om^(3;i)) , . . . (Om^(xs))} . 

Therefore, the median of the sampled values 

{g{OMz{xi)), g{OMzixs))} n [g{OMz{xi)), g{OMz{y))] 

equals 5(f), if we count each sampled value with multiplicity one. 

By Corollary [9l once we have identified 5(f) we also know ([^ it K^/n\), with high proba- 
bility. 

For odd values of n a similar reasoning shows that Algorithm [8] computes <^([|^] ): Either we 
have OMz{xi) G {§ — 1, §+1} (lines 3-8) in which case the two values g{OMz{y)) and g(OM^(xj)) 
must be two consecutive values in {g{OMz{xj)) \ j G [s]} or we identify as above ^([f ]) as 
the median integer value of [OMz{xi), OMz{y)] (if OMz{xi) < OMz{y)) or [OMz{y), OMz{xi)] (if 
OMz{y) < OMz{xi)), respectively. □ 

Part 3: Calculation of z 

In this section, we prove that the s random samples and the one additional sample needed 
to identify 5([§]) suffice to determine the target string z. We do so by showing that the 
probability that there exists a bit string y ^ z with OMy{xi) = OMz{xi) for all i G [s] with 
OMz{xi) G ± K^/n\ is small. 

^That is, {g{OM4xi)),...,g{OM,{xs))} D {[g{OM,ixi)),g{OM4y))]U [g{OM4y)),g{OM4x,))]) = 
{g{OM4x,)),g{OM4y))}. 
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Algorithm 8: Identifying for odd values of n. 



1 Sample i £ {j £ [s] \ g{OMz{xj)) = m} uniformly at random; 

2 Set y ^ complenient(x) and query g{OMz{y)); 

3 if 5(0M^(y)) < g{OM^{xi)) then 

4 
5 



if Vj G [s] : {g{OM,{xj)) < g{OM,{y))) V {g{OM,{xj)) > g{OM,{x,))) then 

L 



6 else if g{OMz{y)) > g{OMz{xi)) then 

7 if Vj G [s] : ((7(Om^(xj)) > g{OM,{y))) V (5(Om^(xj)) < 5(Om^(x,))) then 

8 \_mg^ g{OMz{y)); 

9 else nig ^ median(g(OM2(y)), g(OM^(xi))|c/(OM^(xi)), . . . , g{OMz{xs))); 
10 output nig 



Lemma 13. Let S := {i £ [s] \ OMz{xi) € zt^-^/n]} 6e i/ie set o/ all samples with OMz-value 
close to ^. Let F := {y \ \/i £ S : OMz{xi) = OMy{xi)} be the set of all y that are consistent 
with the OMz-values for all xi with i £ S. Let X be a constant. 

Then, with probability at least 1 — exp( g^^;^^^ ), we have E[\F\] < 1 + 2"*/^ for 

t:= f(l-2e-2'''). 

In particular we have that, with probability at least 1— cexp(— ^"" ) for some constant 
c, there does not exist a string y G {0, l}"'\{z} such that OMz{xi) = OMy(xj) for all i £ S. 

Proof. First note that, by Lemma [TOl we can assume that |S| > |(1 — 26"^**^) = t, with 

probability at least 1 — exp(— )an y 

Let y £ {0, l}"\{z} and let h := \ y (B z\i be the Hamming distance of y and z. We bound 
the probability that for alH G 5 we have OMy{xi) = OMz{xi). 

If we consider one particular sample x chosen from {0, 1}" uniformly at random, we have 

r , ■. , s : . ^ r ^.1 \OMy(x) = OM^fx)] 

Pr [OUyix) = OMz{x) I OUzix) £ [§ ± ^^]] < ^ ' ' ,n , r^,^ • 

Pr [OMz[x) G ± K^/n\\ 

By Lemma [10] the probability Pr [Om2(x) G ± K-y/n]] that the OM^-value of x lies in the 
interval it Ky/n\ can be bounded from below by 1 — 2e~^'^^. 

Furthermore, OMy{x) = OMz{x) holds if and only if x coincides with z in exactly half of 
the h bits in which z and y differ. Thus, Pi[OMy{x) = OMz{x)] = {1^/2)'^"^ ^ ^® even and 
Fi[OMy{x) = OMz{x)] = for odd values of h. In particular, 

Pr [Om,(x) = Om,(x) \x£S]< yI^^ , 

for all even values h and Pr [OMy(x) = OMz{x) | x G S] = for odd values h. 

Assume h to be even. As the samples drawn independently, the probability 

that OMj^(xj) = OMz{xi) for alH G S* can be bounded as follows. 

/ ( \2^h \ * 

Pr [ l\ (OMyixi) = OMz{x,))] =1[Pt [OUyix,) = Om,(x,)] < ■ 
i&s ies \i ze J 
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As there are (^) different bit strings y with Hamming distance |y©2;|-|^ = h from z, we 
bound the expected number of bit strings y ^ z with OMy{xi) = OM^(xj) for all i £ S from 
above by 




h€.[n];h even 



It has been proven in [5l Proposition 8] that for sufficiently large n and i > 2(H — °iog ^ ) log n ' 
it holds that 



E 0(O2-'')'<("/2)'2- 



3*74 



/i6[n];/i even 



In our case the condition a > 8(l - 2e-2«') ^ > 4(1 _ 2e-2«') ^(l + il^i^) 



logjn 



ensures 



that t satisfies this condition. Furthermore, we have for large enough n that f < 2*/^ and 
lastly, the requirement k > 2 implies that 2e~^''^ < 0.15. Hence, (l — 2e~^'^^) ^ < 0.85~^ < 
1.18 < 2^/'^ and finally, (l - 2e-2'^')~* < 2*/^. We thus conclude that, with probability at least 
1 cxpf (i-^'^"'"')"" ) 



E[|{y / z I Vx G 5 : OMj^(x) = Om^(x)}|] < 2"*/^ . 

□ 

Before we prove Theorem [6l let us briefly remark the following elementary fact about black- 
box complexities. 

Lemma 14 (from [U]). Suppose for an optimization problem P there exists a black-box algo- 
rithm A that, with constant success probability, optimizes P in s iterations. Then the black-box 
complexity of P is at most 0{s). 

We are now ready to prove Theorem [6] for the special case of arity k = n. To this end, we 
fix the values of k := 2 and a := 9 = 8(1 — 2e~'^'^^)~^ 

Proof of Theorem\^for k = n. We need to show that there exists a ranking-based unbiased 
algorithm which optimizes any function Om^ G OneMax„ in an expected number of 0(?i/ log n) 
queries. 

We claim that Algorithm certifies Theorem O for k = n and even values n. For odd values, 
the part in which we identify ^([f]) (lines 5-8 of Algorithm [9]) needs to be replaced by lines 
1-9 of Algorithm [8j We show that the probability that Algorithm [9] queries z after 0(n/ log n) 
iterations is 1 — o(n~^) for all constant values A. By Lemma [HI this implies the desired bound 
for the 7x-ary black-box complexity of OneNIax^. 

First we show that the algorithm employs only unbiased variation operators of arity at most 
n. We have already argued that sampling uniformly at random from {0, 1}" is a 0-ary unbiased 
variation operator and that complement(-) is a unary unbiased one. Therefore, we need to show 
that the operator "sample x £ F uniformly at random" is unbiased and of arity at most n. The 
latter follows from the fact that the size of S is at most s G O(n/logn). The unbiasedness of 
the variation operator follows essentially from the fact that we sample from F uniformly. More 
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Algorithm 9: An n-ary unbiased ranking-based black-box algorithm optimizing 
OneMax„ in 0(n/ log n) queries. 

1 Initialization: 

2 for i = 1, . . . , s do 

3 |_ Sample Xi G {0, 1}" uniformly at random and query g{OMz{xi)); 

4 Identification of g{^): 

5 Sample i £ {j £ [s] \ g{OMz{xj)) = m} uniformly at random; 

6 Set y ^ complement (xj) and query g{OMz{y))] 

7 if g{OMz{y)) = g{OMz{xi)) then rUg ^ m; 

8 else rUg ^ median{g{OMz{y)), g{OMz{xi))\g{OMz{xi)), . . . , g{OMz{xs))); 

9 Compute ^ {z G [s] I OM;,{xi) G [§ ± 2y/n\}; 

10 Compute F ^ {y € {0, 1}" | Vi G 5 : Ou^ixi) = OMy{xi)}] 

11 Sample x £ F uniformly at random and query g{OMz{x)); 




precisely, let us define the following family of |5|-ary distributions over {0,1}'^. Abbreviate 
F{w^,...,w\^\) := {y G {0,1}'^ | G : OMz{wi) = OMy{wi)} and set 

if F{w^, wl^l) / and G F{w'^, w\^\) , 
D{x\w'^, := <j 0, if F{w\ ^1^1) / and ^ F{w^, w^^^) 

otherwise. 

It is now easy to verify that D{- \ , . . . , w^^^) ^|sig|oi}n is a family of unbiased dis- 
tributions: Let y G F{w^ , . . . ,w^^^) and v G {0,1}". For all j G clearly we have 
OMy(^^{w^ (Bv) = OMy{w^) and, consequently, we have y(Bv G F{w^(Bv, . . . ,w^^^(Bv). Similarly 
we conclude that for all permutations a of [n] we have cr(y) G F{a{w^), . . . , a{w^^^)). The same 
reasoning also proves that F{w^, . . . , w^^^) = if and only if F{a{w^ (Bv),..., a{w^^^ © v)) = 0. 

Each run of Algorithm [9] requires an/ Inn + 2 £ 0{n/ log n) queries. The total probability 
of failure is at most the sum of the probabilities that 

• the median of the OM^(xi)-values is not in it y/n\, 

• the probability that there exists a value ^ G =b 2-y/n] with OM^(xj) ^ i for all j G [s], 

• the probability that the size of 5 = {i G [s] \ OM^ixi) G [§±2A/n]} is less than |(l-2e~®), 
and 

• the probability that in line 11 we do not sample z. 

Each of these probabilities is at most o(n~^) for any constant value A G M. By a simple union 
bound we infer that the probability that the target string z is sampled in one run of Algorithm [9] 
is at least 1 — o{n~^). This concludes the proof. □ 

4.3 Proof of Theorem d] for A; e ^{\og^ n) 

The proof for the general case uses a simple idea also exploited in [5]: Given some arity A;, 
In"^ n < k < n, we subdivide the whole bit string into blocks of length k. We show that these 
blocks can be optimized one after the other, each in 0(A;/logA;) steps. As there are [n/A] such 
blocks, the desired 0{n/ logk) runtime bound follows. 
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Algorithm 10: A fc-ary unbiased ranking-based black-box algorithm optimizing 
OneMax„ in 0(n/ log A;) queries. 

1 Initialization: 

2 Sample x £ {0, 1}" uniformly at random and query g(OMz{x)); 

3 Set y ^ complement(x) and query g{OMz{y)); 

4 for j = 1, . . . , [n/fc] do 
5 
6 
7 

8 

9 
10 
11 
12 
13 



Sample x^^'^^ ^ f lipKWhereDif f erent(3;, y) and query g(OMz{x^^'^^)); 
for i = 2, . . . , s do 

Sample x^^''^^ from {v G {0, 1}" | \/£ G I^^^ : Ve = xi} uniformly at random and 
query g{OMz{x^^''^)); 

Identify g{^ + c^^^) ; / /c^-'^ is the contribution of bits in to the OM^-values 
Compute S^^'^ ; / /set of samples with (g o OM^)-value in [| it 2\/li + c^-'^] 
Compute F^^^ ; //set of all feasible bit strings 
Sample z^^^ G F^^^ uniformly at random and query g{OMz{z^-^^)); 
Update y ^ VL-pda.te{y, x, x^^'^\ z^^")) and query g{OMz{y)); 
Update X ^ z^^^; 



Proof of Theorem\^for k G 0(log^ n). To ease reading we assume that k is even. For odd values 
of k, in the following proof all occurrences olk/2 must be replaced by \k/2~\. Further we skip 
the "with probability at least. .."-statements. Instead, we bound the total failure probability at 
the end of this proof. Since k is large, a simple union bound will show that we can assume all 
statements to hold with high enough probability. 

Fix a := 9 = ["8(1 — 2e~^) ] and s := ak/lnk. For a better presentation of the ideas let 
us fix the unknown target function Om^ G OneMax„. 

By Lemma O it suffices to show that, with high probability, Algorithm 1101 queries the target 
string z. Each run of Algorithm 1101 requires 0(ri/ log fc) queries. 

The notation used in Algorithm 1101 is as follows. 

For x,y £ {0, 1}" by 0{x,y) we denote the set {i G [n] \ Xi = yi} of all indices in which 
X and y coincide. As we shall see below, throughout the run of Algorithm [TUl the set 0{x, y) 
equals the set of positions for which we know that Xi = Zi must hold. We call these positions 
optimized. 

The variation operator f lipKWhereDif f erent(-, •) is a binary operator that given two strings 
x,y G {0, 1}*^ picks uniformly at random k' := mm{k,n— \0{x, y)\} different elements ii, . . . 
from the set of positions [n]\0{x,y) in which x and y disagree. It outputs the string x ® e^_^ (B 
. . . ® ef^^. That is, it flips k' positions in x in which x and y differ. This is easily verified to 
be an unbiased operator. As in the proof of the unbiasedness of the operator "sample x € F 
uniformly at random" in Section IT2] this follows essentially from the fact that (i) the k' positions 
are sampled uniformly at random from [n]\0{x,y), that (ii) for all w G {0,1}'^ the equation 
0{x®w, yQw) = 0{x, y) holds, and that (iii) for all cr G S*™ we have 0{a{x),a{y)) = a{0{x, y)). 

Let j G [[re/fc]]. The strings x and x^^'^^ are used to encode which substring ("block") is to 
be optimized in the j-th phase. Namely, throughout the j'-th phase all entries in positions 

:= 0(x,x(^'i)) = {i G [n] \ Xi = x?'^^} 

remain untouched. We only allow the entries in positions R^^'' := [n]\I^^^ to be flipped. We call 
I^^^ the set of irrelevant indices and we call R^^^ the set of relevant indices. Unless we are in 
the very last phase j = In/k] we have |/'--'^| = n — k and \R^^^ \ = k. 
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Similarly one easily verifies that the operator "Sample x^^'^^ from {v G {0, 1}" | \/i S I^^^ : 
V£ = xe} uniformly at random" employed in line 7 of Algorithm [10] is an unbiased one. The 
underlying distribution can be specified as 



D{c I a,b) :-- 




if Xi = Oj for all i G [n] with ai = bi , 
otherwise 



for ah a,6,c G {0,1}". 

Since we do not touch the entries in positions all search points sampled in the j-th 
phase (lines 5~13) have an OM^-value of at least c^^^ := \{i G | Xi = Zi}\. Our algorithm, of 
course, does not know the value of c^^\ Nevertheless we are able to infer + c^^)). This can 
be done as follows. Let x^^''^^ be one of the search points x^^''^\ . . . , x^^'^^ sampled in line 7 such 
that g{OMz{x^^''^^)) equals the median of the multi-set {g{OMz{x^^'^^)), . . . , g{OMz{x^^''^^))} of 
the sampled {g o OM^)-values. 

Let i^-^'*) be the bit string which, on the relevant k bits, is the bitwise complement of x^^'^\ 
Formally, x'f''^ := 1 - xj^''^ for all £ G i?^^) and x'f''^ := xf"^ = x^ for all i G Clearly, x^^'*) 
can be obtained from x^-''*^ x, and x'^^'^^ from the 3-ary unbiased variation operator that, given 
a, b,c,d£ {0, 1}" satisfies 

D{d \a,b,c) = {' 



if y£ G : [be ci ^ de = 1 - ai) A {hi = cg ^ di = ai) 
otherwise. 



To identify ^d+c^^^), in line 8 we query g (Om^(x(J'*))) . Clearly, g{OM,{x^^''^)) = g{^+c^^^) 
if and only if g[OMz{x''^''^^)) = g{OMz{x^^''^^)) . In case g[OMz{x'-^'^^)) / g{OMz{x'-^'^^)) 
we know by Lemma [12] that g{^ + c^-'^) is the median of the sampled values between 
[g[OMz{x^^'^^)),g{OMz{x^^''^^))] or [g[OMz{x^^'^^)) , g[OMz{x^^'^^))], respectively, where each 
sampled {g o OM2)-value is counted with multiplicity one. 

Note that once we have determined g{^ + c^^'^), we can compute the sets 

:= {i G [s] I Om^(x(-''^)) G [| ± 2\/A? + c(^')]} and 

:= {v G {0, 1}" I (ye G /(^'^ : Vi = xg) A (Vi G S^^^ : OMz{x^^''^) = Om^(x(^'^))) } . 

This is due to Lemma [3 where we have shown that for all £ G [| =b there exists at least 
one i such that OUzix'^^'^^) = i + c^-^^. 

We call F^^') the set of all feasible bit strings for the j-th block. In line 11 we sample from 
this set uniformly at random. This is an unbiased operation, as can be shown like we did in 
the proof of Theorem [6] for k = n (see the discussion in that proof just after the definition 
of Algorithm [Uj) . Note that for identification of F^^^ we need at most all samples in S^^^ plus 
the two strings x and x^^'^^ which encode the current block we are optimizing. This is, the 
arity of the corresponding variation operator "sample x G F^^^ uniformly at random" is at most 
l-SI + 2 < 9k/ Ink + 2. Since we assume that k G r2(log^ n), this expression can be bounded by 
k for large enough n. 

For the same reason we can assume that \F^^^\ = 1. This is due to Lemma [13] where we 
have shown that \F^^^\ = 1 with probability at least 1 — 2~^^^~'^'^ \ Note that under this 
assumption, the point z^^'^ sampled in line 11 coincides with z on all relevant bits R^^'^ and it 
coincides with x on all other bits 

In the last step of the j-th phase we need to update x and y. Recall that by 0{x, y) we 
indicate which bits have been optimized already. So we need to update 0{x,y) by adding to it 



22 



R^^\ This can be done in the following way. First we update y by replacing it with 



H . ( OM) (i)^ J^J'^' if^e^^'^ 

update(y, X, x^-^' \z^-") '■= \ 

\yi, otherwise. 

Formally, update(-, •, •, •) is a 4-ary variation operator that, given some a, b,c,d S {0, 1}" returns 
a vector with (update(a, 6, c, d))i = Oj for all i with bi = Ci and (update(a, b, c, = di for all i 
with bi ^ Ci. Clearly, update(fT(o ® w),a{b © w), (t(c© (7(d ® if)) = (T(update(a, 6, c, d) © 
for all bit strings w G {0,1}" and all permutations a € Sn- Therefore, update(-, •, •, •) is an 
unbiased variation operator. 

In line 13 we finally update x by replacing it with z^^\ i.e., we set x ^ z^^\ This concludes 
the j-th phase. Summarizing all the above, we have reduced the Hamming distance from x to 
y by k' . We also have yi = Xi = for all i G R'^^^ and, as we shall see below, with high 
probability, this translates to Xi = Zi for all i £ R^^\ 

The total number of search points queried in the j-th phase is s + 3 G 0(A;/ log A;). Hence, 
the total number of queries made by the algorithm is [n/fe] (s + 3) + 2 G 0(n/ log h). 

To conclude the proof let us bound the total probability of failure. For each block j the 
total probability of failure is at most the sum of the probabilities that 

• the median of the OM2(xi)-values is not in [| it ^/k + c^-^^], 



the probability that there exists a value £ e [| ± 2\/fc + c'^^^] with Om^(x(-''*)) / £ for ah 
i G [s], 

the probability that the size of 5 = G [s] I OM^(xi) G [| ± 2y/k + c(j)]} is less than 
f(l-2e-S), and 

• the probability that in line 11 we do not sample z. 

Each of these probabilities is at most 0(\/A; exp(— i/fc/ log A:)). By the union bound the total 
failure probability is at most 0(n//i;)0(\/A;exp(— \/A;/log A;)), which due to the fact that k G 
O(log^n), is o(l), as desired. □ 

4.4 Proof of Theorem O for k G 0(log^ n) 

In the last part of the proof for Theorem[6l we consider here in this section the case k G 0(log^ n). 
Again we do a block-wise optimization of the target function. However, for such small values 
of k, the union bound does not suffice for a high probability statement. Instead, we need to 
identify ways to ensure that each block- wise optimization yields the desired equality zl = Zi 
for all i G R'^^\ in the notation of the previous section. This can be done by optimizing the first 
length-A: block using the linear query time strategy implicit in Lemma [71 We use this block as 
a reference block. By flipping all bits in the reference block and flipping all bits in the block 
currently under investigation, we can probe whether or not all bits in this block coincide with 
the corresponding entries of the target string. 

Proof of Theorem\^for k G O(log^n). As we have seen in Section 14.11 for constant values of 
k, Theorem [6] is a special case of Lemma [71 Therefore, we can assume in the following that 
k = k{n) grows with n. Further we assume that k is even. For odd values of A;, in the following 
proof all occurrences oi k/2 must be replaced by [A;/2]. 

Fix a := 9 = ["8(1 — 2e~^) ^] and s := ak/lnk. For a better presentation of the ideas let 
us fix the unknown target function Om^ G OneMax,„. 
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First we show that Algorithm [TT] creates in /c + 2 queries two search points x' and y' of 
Hamming distance n — k with the additional property that for all i G [n] with x'- = y'- we know 
x[ = Zi with certainty. 



Algorithm 11: A binary unbiased ranking-based black-box algorithm for optimizing the 
first block of length k. 

1 Initialization: Sample x' G {0, 1}" uniformly at random and query g{OMz{x'))] 

2 Set y' ^ complement(x') and query g{OMz{y')); 

3 Optimization: for t = 1, . . . , /c do 

4 Sample w ^ f liplWhereDif f erent(a;', y') and query g{OMz{w)); 

5 if g{OMz{w)) > g{OMz{x')) then Update x' ^ w; 

6 else Update y' ^ update2(y', x', u;); 



To this end, let us first fix the notation, f liplWhereDif f erent(-, •) is the operator in- 
troduced in Algorithm [TOj It fiips exactly one bit value of the first argument. The position is 
chosen uniformly at random from the set of positions in which the first and the second argument 
disagree. 

update2(-, •, •) is a variation operator, that creates from y',x',w G {0,1}"" the string 
wpda.te2{y' , x' ,w) with (update2(y', x', u'))i = y'^ if x'^ = Wi and (update2(y', x', = x'^ if 
Wi. This is easily verified to be an unbiased variation operator. 

In line 5 of Algorithm [11] either we have g{QMz{vj)) > g{OMz{x')) — in which case the 
position i in which x' and w diff'er satisfies x[ ^ Zi = Wi = y[. In this case, updating x' clearly 
reduces the Hamming distance of x' and y' by one. It preserves the invariant that x'- = zi 
for all positions i G [n] with x'^ = y[. If, alternatively, g{OMz{w)) < g{OMz{x')) holds, then 
x[ = Zi ^ Wi = y[ and we update y' by replacing its i-th bit with x'^. This also reduces the 
Hamming distance of x' and y' by one, again preserving the invariant x^ = y'^^ x'^ = Zi. 

Therefore, after termination of Algorithm 1111 we have two bit strings of Hamming 

distance \x' ® y'\i = k that satisfy (x^ = y^) =^ (x^ = Zj). In what follows, we call the bit 
positions 0{x',y') = {i G [n] \ x'- = y[} the "reference block". Next we show how this block 
allows us to verify that another block of length k is optimized (i.e., that the entries of this block 
coincide with the entries of the target string z). 

The basic idea is simple: first we create from x' and y' two strings x and y such that 
0{x,y) = 0{x',y') but Xi ^ x\ for all i G 0{x',y'). Certainly we have Xj / Zi for all such 
i G 0{x',y'). Starting from x and y, we run the same block-wise optimization routine as in 
Subsection 14.31 where the blocks are chosen from [n]\0{x' ,y'). If we want to test that k specific 
bits of some candidate string z^^^ coincide with the entries of z, all we need to do is to flip 
in z^^^ all k bits of interest as well as all bits in block 0{x',y'). Flipping the bits in block 
0{x',y') increases the OM^-value by k since for all i G 0(x',y'), by construction, zp'* = Xj 7^ Zi. 
Therefore, the OM^-values of the candidate solution z^^^ and its offspring z^^^ coincide if and 
only if z^^^ and z coincide in the k bits of interest. 

The notation in Algorithm [12] is the same as the one used in Algorithm [TO] In addition, we 
make use of the following operators. 

The string initializei(x', y') is defined via 



(initializei(x', y'))i : 




ifiGO(x',y'), 
if i ^ 0{x',y'). 
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Algorithm 12: A k-ary unbiased ranking-based black-box algorithm for k G O(log^n) n 
a;(l) optimizing OneMax„ in 0(n/logA;) queries. 

1 Input: Two bit strings x' and y' with 0{x',y') = k and x'^ = Zi for all i £ 0{x',y'); 

2 Initialization: 

3 Set x ^ initializei(x', y') and query g{OMz{x)); 

4 Set X ^ initialize2(x', y') and query g{OMz{y)); 

5 for j = 1, . . . ,\{n — k)/k] do 

6 repeat 

7 Sample x^^'^^ ^ f lipKWhereDif f erent(2;, y) and query g{OMz{x^-^'^^)); 

8 for i = 2, . . . , s do 

9 Sample x'^-^'*^ from {v G {0, 1}'" | V£ G I^^^ : V£ = xi} uniformly at random and 
_ query 5-(Om^(2;(^'*))) ; 

Identify + c'--'^) ; / /c^^^ is the contribution of bits in I^^^ to the OM^-values 
Compute S^^^ ; //set of samples with {g o OM^)-value in [| it 2\/A- -|- c^-^^] 
Compute F^^^ ; //set of all feasible bit strings 
Sample z^^^ G F^^^ uniformly at random and query g{OMz{z^~'^)); 
Set z^^^ ^ test(z(-^'),x',y',x,x(-''^)) and query g{OMz{z'^^y))] 
until 5(Om,(z(J'))) = 5(0M,(i(j'))); 

Update y ^ n-pda.te{y,x,x^^'^\ z^j^) and query g{OMz{y)); 
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11 

12 
13 
14 
15 
16 
17 



Update X 



18 Set w ^ f inish(x, x', y') and query g{OMz{w)); 



Similarly, we set 



(initialize2(x', := 




ifieO{x',y'), 
\iiiO{x\y'). 



The string initializei(x', y') is obtained through sampling from the distribution D{w \ 
x' , y') = \\iwi = 1 — if and only if z G 0{x' , y'). This is an unbiased distribution. Hence, both 
initializei(-, •) and, by similar reasoning, initialize2(-, •) are unbiased variation operators. 
In line 14 we query test{z^^\x' ,y' ,x,x^^'^^) which is defined via 

, , 0) , , 0-1).. / 1 - if ^ e 0(x', y') or x, + x^^ , 

(test(z^-^^x ,?/ ,x,x^^' 0)i := < . * . 

Iz> , otherwise. 

Again this is easily verified to be sampled from an unbiased (5-ary) distribution. 
Lastly, we define finish(x,x') via 

1-Xj, if i G 0(x',?/') 1 
Xi, if i ^ 0(x',?/') • 

After having optimized all bits in [n]\0(x', y'), this operator finally replaces in x the entries in 
0(x',y') by their complement. Therefore, f inish(x, x', y') equals the target string z. 

Note that Algorithm [T2] queries z in line 18 with certainty. Hence, we only need to argue 
that the expected number of queries of Algorithm 1121 is 0(n/ log fc). We optimize \{n — k)/k~\ 
blocks of length k. Call each execution of lines 7-14 for optimizing a block B a run for B. As 
argued in Section [4.31 for any such block i?, the probability that in line 14 of Algorithm 1121 we 



(f inish(x,x',y'))i 
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have g{OMz{z^^^)) = g{OMz{z^^^)) already after the first run for B is at least 1 — o{k~^) for all 
constant values A. In particular, it is at least constant. This shows that at most a constant 
number of runs are expected for optimizing any block, compare Lemma [TH Each run requires 
s + 2 £ 0{k/ log k) queries. This shows that we need an expected number of 0(A:/ log k) queries 
to optimize any length-Zc block. Since there are \{n — k)/k'] of them, the statement follows by 
linearity of expectation. □ 



5 The Different Black-Box Complexities of BinaryValue 

In the previous section, we have seen that the additional ranking restriction did not increase 
the black-box complexity of the OneMax functions class. In this section, we show an example 
where the two kinds of complexities greatly differ. Another simple class of classical test functions 
does the job, namely the class of generalized binary- value functions. 

The binary-value function Bv is defined via Bv(x) = Y17=i that is, it assigns to each 

bit string the value of the binary number it represents. As before, we regard here generalizations 
of this single function. 

In the following we denote by 5 the Kronecker symbol, i.e., for any two numbers /c,^ G No 
we have 5{k,i) = 1 k = I and 5{k,i) = otherwise. 

Definition 15 (BinaryValue function class). For z G {0,1}" and a G Sn, we define the 
function Bv^,<^ : {0,1}" No,x ^ Bv{a{x © z)) = EILi -^aw)- set Bv^ := 

^^z,id[„] ■ We define the classes 

BinaryValue.„ := {BV2 | z G {0, 1}"} , 
BinaryValue* := {Bv^,^ \ z e {Q, l}",fT G Sn] ■ 

If f £ Binary Value„ (f G BinaryValue*^, there exist exactly one z G {0,1}" (exactly one 
z G {0,1}" and exactly one a G Sn) such that f = Bv^ (f = Bv^^g-j- Since z = argmaxBv^ 
(z = arg max Bv^^o-J; we call z the target string of f. Similarly, we call a the target permutation 
o/Bv^,^. 

We show that the unbiased black-box complexity of the larger class BinaryValue* is 
O(logn), cf. Theorem 1161 whereas the unrestricted ranking-based black-box complexity of the 
smaller class Binary Value„ is r2(n), cf. Theorem 1171 

Let us begin with the upper bound for the unbiased black-box complexity. 

Theorem 16. The *-ary unbiased black-box complexity o/ BinaryValue* (and thus, the one 
o/ Binary Value„J is at most [log2n] +2. 

For every z G {0, 1}" and a £ Sn the function Bv^^o- has 2" different function values. That 
is, the function Bv^^o- : {0, 1}" — >■ [0..2" — 1] is one-to-one. This is in strong contrast to the 
functions Om^ G OneMax„ which obtain values in [0..n] only and are thus far from being 
one-to-one functions. Therefore, from each query to a BinaryValue function we obtain much 
more information about the underlying target string than we would gain from any OneMax 
function. In particular, for each query x and for each i G [n] we can derive from JiVz^ai^) 
whether or not ^^.(j) = 2;^(j). Hence, all we need to do is to identify a. This can be done by 
binary search. 

Proof of Theorem \1(A We show that Algorithm [13] is an unbiased black-box algorithm which 
optimizes every Bv^^o- G BinaryValue* using at most [log2 n] -|- 2 queries. 
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Algorithm 13: An unbiased black-box algorithm optimizing BinaryValue„ in [log2 n] + 
2 queries. 

1 Sample x*-^-* G {0, 1}" uniformly at random and query BYz,<t{x^^'^)', 

2 for k = 2,. . . , [log2 n] + 1 do 

3 |_ Sample x^'^) ^ f lipHalf (x^-*^), . . . , x^'^^^^) and query BVz^a{x^''^); 

4 Set x(ri°g2"l+2) ^ consistent(x(i), . . . , 2;(ri°g2 "l+i)) and query Bv^,<,(x(ri°g2 «l+2)). 



To describe the variation operators used in Algorithm [131 let /c G N and let y^^\ . . . , £ 
{0,1}". Set 

FW(yW,y(2)):= M\FW(y«,y(2)). 

That is, F^^^y^^^y^"^^) contains exactly those bit positions in which y^^^ and y*-^^ disagree and 
F^^^y^^^y^"^^) is the set of positions in which y^^^ and y^^^ coincide. 
Let 1 < ^ < A;. For each {ii, . . . G {0, 1}^ we set 

^(n,...,i„0)(y(l)^ _ _ _ y{i+2)^ p{n,...M)^y{l)^_ . . , y(^+l))\i7(n,...,i.,l)(y(l)^ . . . ^ y(^+2)) _ 

This way we iteratively define F^^^'---''^''\y^^\ . . . ,y^''~^^'^) for all (ii, . . . G {0, l}'^. For any 
such vector (ii, . . . G {0, l}'^ the set . . . , y^^+i)) contains exactly the subset of 

positions from i^(*i'---'*fc-i)(y(i), . . . in which y^^^ and ?/('^+^) agree {ik = 0) and for 1^ = 1 

it contains the subset of positions in . . . ^ y'-'^^) in which y^^^^ and yC^^^) disagree. 

Let 

. . . , y W) := G {0, 1}" | V(ii, . . . , G {0, 1}^-^ : 

. . . = . . . ,yW)|/2j } . 

That is, Z{y^^\ . . . ,y^^^) is the set of bit strings y^^~^^\ which, for every (ii, . . . , par- 
titions the set . . . into two subsets . . . , yC^+i)) and 
^(ii,...,if_i,0)(y(i)^ ^ . . of (almost) equal size. 

For all £ G N, the variation operator f lipHalf (•, ...,•) samples from the £-ary distribution 
. . . ,-))y(i),„,,yWe{o,i}"> which for given y^^\ . . . G {0,1}'" assigns to each y G {0,1}" 
the probability 

1 0, otherwise. 

This is an unbiased distribution: For all y,w,y^^\...,y^^^ G {0,1}" and all (zi, . . . , G 
{0, lY~^ we have 

From this we easily obtain that y G Z{y^^\ . . . , y^^^) if and only if y©u) G Z{y^^^(Bw, . . . , y^^^(Bw). 
In addition, for all G Sn we have 
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and thus, y € Z{y^^\ . . . ,y^^^) holds if and only if 9{y) G Z{9{y^^^), . . . ,9{y^^^)). From this and 
the fact that each bit string in Z{y^^\ . . . ,y^^^) is assigned the same probability we infer that 
f lipHalf is an unbiased variation operator. This is true for all i. 

For the description of the second variation operator we abbreviate t := [log2 n] + 1 and we 
assume that BVz,it G BinaryValue* is the (unknown) function to be optimized. 

For ah G {0,1}" let 

_^consistont(y(i)^ _ . . , y^^^) := {z' E {0, 1}" | 3a' G 5„Vi G [t] : Bv,, = Bv,,,(2/(^))} , 

the set of all bit strings that are consistent with the queries y^^\...,y^^\ This is the set 
of all possible target strings. As we shall see below, in line 4 of Algorithm [13] we have 

Inconsistent ^^(1) a;^*-')] = 1 

Abbreviate jrconsistent^yCl)^ _ _ _^y{t)^ ^ jrconsistent ^ y ^ l}n 

i^'(y|y«,...,yW):= 

This is the distribution from which the variation operator consistent(y(^), . . . , y^*)) sam- 
ples. It is an unbiased t-ary distribution as can be easily verified using the fact that 

y G jrconsistent(^y(l)^ _ _ _ ^y(t)) -f ^^^^ ^^i^ if y q ^ g jrconsistent (^(1) , , , ^ y(i) ^) ^nd if 

and only if d{y) G jr™nsistent (51(^(1)), . . . ^ 6l(?/(*))) for every 6 G 5^. 

In what follows we argue that in line 4 of Algorithm [T3] there exists exactly one string 
z' G j^co°sistent^^(i)^ ^ ^ ^ jX*^*)). In this case, clearly, z' = z must hold. 

Let us first show that from x^^\...,x^^^ we can infer the underlying target permuta- 
tion a. We do so by proving that (a) for any 2 < k < t and for all j G [n] we can deter- 
mine the index (ii, . . . , G {0,1}^^-*^ with a{j) G F(*1'---'*''-i)(x(^), . . . , a;^^)). This suffices 
to determine a because, by construction, for all vectors {ii, . . . ,it^i) G {0,1}*"^, we have 

< 1. 

The key argument proving statement (a) is the injectivity of Bv^^o-i which, for any index 
j G [n] and for every search point x G {0, 1}", reveals whether or not 2;cr(j) = ^a{j)- Let us fix 
an index j G [n]. Clearly, a{j) G F^^\x^^\x^'^')) if and only if 

(1) A (2) ^ 

• Mj) = Mi) and x'^^.^ / or 

(1) -I A (2) 

Similarly, if a{j) G F^'^ x*^'^)) is known, then a{j) G 

if and only if 

(fc) , (fc+i) , 

• Mj) = M3) and / or 

(fc) , , (fc+i) 

• ^aU) ^ Ml) and = ^,(,). 

Hence, in hue 4 of Algorithm [13] we know a. Let z' G j:"consistent|'^(i)^ _ _ _ ^ x*^*^). By definition, 

there exists a permutation a' G 5„ such that for all i G [t] we have Bv^/.ct'(x(*^) = Bv2^o-(x(*^). 

Let j G [n]. As we have shown above, we can identify the vector (ii, . . . , such that 

a{j) G F(*1'-'*'-i)(x(^), . . . By construction, also a'{j) G . . . , x^) must 

hold. This shows a' = cr. Hence, Bv^/^o-'(^^"^^) = z' ,ct{x^^'') = Bv^^o-(2^'-^'')- But as this requires 
z' = z, we conclude that indeed |jrconsistent(^^(i)^ ^ ^ ^ = 1. 



I ^consistent | — 1 y (z ^consistent 

0, if JC-consistent ^ and y ^ jrconsistent 

2~", otherwise. 
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Putting everything together we have shown that from the first t = [log2 n] + 1 samples we 
can infer both the target permutation a as well as the target string z. This can be sampled 
from an unbiased distribution in the ([log2 n\ + 2)nd query. □ 

Let us now prove that already the unrestricted ranking-based black-box complexity of 
BinaryValue^ is asymptotically different from the basic unbiased one of Binary Value* . 

Theorem 17. The unrestricted ranking-based black-box complexity of BinaryValue„ and 
Bin ARY Value* is larger than n — 1. 

As discussed in the introduction, Droste, Jansen, and Wegener [10] implicitly showed 
a lower bound of il(n/logn) for the unrestricted ranking-based black-box complexity of 
BinaryValue„. Our lower bound of n — 1 is almost tight. An upper bounds of n -|- 1 for 
the unrestricted ranking-based black-box complexity of BinaryValue* can be shown exactly 
in the same way as in |10^ Theorem 5]. Intuitively, the algorithm which starts with a random 
initial search point and then, from left to right, flips in each iteration exactly one bit shows this 
bound. This is a deterministic version of Algorithm [6l 

For the unbiased black-box complexities of BinaryValue.„ and BinaryValue* the sit- 
uation is as follows. Both the basic as well as the ranking-based unary unbiased black-box 
complexity of BINARY Value„ are of order 0(nlogn). The lower bound follows from the al- 
ready mentioned theorem in \n\ Theorem 6], which implies that any function with a single 
global optimum has an unary unbiased black-box complexity of Q{nlogn). The upper bound 
follows from the fact that, for example, Random Local Search (Algorithm [5]) solves any instance 
Bv^^o- € BinaryValue* in an expected number of 0(n log n) queries. The latter follows from 
the coupon-collector's problem. 

For higher arities k > 2 Theorem [T7] and Lemma [7] from Section 14.11 immediately yield the 
following. 

Corollary 18. For all k > 2, the k-ary unbiased ranking-based black-box complexity of 
BinaryValue„ and BinaryValue* is larger than n — 1 and it is at most 4n — 5. 

To derive the lower bound, Theorem 1171 we employ Yao's minimax principle |24j . 

Theorem 19 (Yao's minimax principle, formulation following [2Uj). Let U be a problem with 
a finite set I of input instances (of a fixed size) permitting a finite set A of deterministic 
algorithms. Let p be a probability distribution over I and q be a probability distribution over A. 
Then, 

minE[r(/p, A)] < maxE[T(/, Ag)] , 

where Lp denotes a random input chosen from I according to p, Ag a random algorithm chosen 
from A according to q, and T{L, A) denotes the runtime of algorithm A on input L . 

We apply Yao's minimax principle in our setting as follows. We show that in the ranking- 
based black-box model any deterministic algorithm needs an expected number of more than 
n — 1 iterations to optimize Bv^, if Bv^ is taken from Binary Value„ uniformly at random. 
Theorem 1191 then implies that for any randomized algorithm A there exist at least one instance 
Bv^ G BinaryValue.„ such that it takes, in expectation, at least n — 1 iterations for algorithm 
A to optimize Bv^. This implies Theorem 1171 

The crucial observation is that when optimizing Bv^ with a ranking-based algorithms, then 
from t samples we can learn at most t—1 bits of the hidden bit string z. This is easy to see for two 
samples x, y. If Bv2(x) > Y^Wziu)-, we see that = 7^ y^, where k := max{j S [n] | Xj / 
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but we cannot infer any information about X£ for £ ^ k. Similarly, if we have t samples 
x^^\ . . . , and their corresponding Bv^-values, we cannot infer (2) bits of information as one 
might guess, but at most t — 1 bits. As we shall see, this is an immediate consequence of the 
following combinatorial lemma. 

Lemma 20. Let t G [n] and let x^^\ . . . be t pairwise different bit strings. For every pair 
{i,j) G [t]'^ we set £ij := max{A; G [n] \ x^*^ 7^ ^k^}) ^-^-j is the largest bit position in which 
and x^^) differ. Then \i,j e [t]}\ <t-l. 

Proof. Let z G {0, 1}". By renaming the bit strings where required, we may assume 'BVz{x^^'^) > 
... > Bv^(a;W). 

We prove the statement by induction on t. For t = 1 and t = 2 there is nothing to show. 
Therefore, we may assume that we have proven \{£ij \ i,j G [A;]}| < k — 1 for some k > 2. Now, 
if ih,k+i S I i,j G [k]} for all h G [k], then clearly we have \{£ij \ i,j G [/c + 1]}| < k — 1. 
Thus, we may assume without loss of generality that there exists a, h £ [k] with ih.k+i ^ I 
i,j G [k]}. 

Since Bv^fx^'^)) > Bv,(x(^+^)), the definition of Bv^ implies that , , , = xi'^'' ^ x^^^^^ . 

Z\ J Z\ J, Z f th,k + l lh.k + 1 ' (-h.k + 1 

Now, let j G [k]. We show that either £j,k+i = ^h,k+i or £j.k+i = (^j,h- 

Let us first consider the case j < h. Since BVzix^^^) > BVzix^'^'^) it holds by the definition 
of Bv^ that Zi-, = x^P 7^ xf^ . Now, either l^h > ^/i fc+i or ^ih < ^hk+i- In the first case 

(k-\-l) (h) (j) 

xy ^ ^ = ^ / h'' ^*^*' "^^'^+1 — ^i'^' other hand, by definition of the li^^^ for ah 

I > tj^h we have x'p = xf^\ For the same reason, due to the fact ^ > > ih,k-\-i-, we also 
have for all such i that = xf^~^^\ From this we infer ^j^k+i ^ ^j,h- This shows tj^k+i = ^j,h- 
Equivalently, if ij^h < 4,fc+i, then xy^'^_^^ = x}J^^_^ / x}^ Thus, £j^k+i > 4,fc+i- On the 

(k~\-l) (h) (j) 

other hand we have for all I > ih,k+i > ^j,h that x^ = x^ = x^ . This shows ij,k+i < ^h.k+i 
and we conclude (-j,k+i = ^h,k+i- 

The reasoning for j > h is the same. □ 

We are now ready to prove Theorem [T71 

Proof of Theorem\17\ Since the search space {0,1}'^ is finite, the set A of all deterministic 
algorithms on BinaryValue^ is finite, if we restrict our attention to those algorithms which 
stop querying search points after the n-th iteration. 

As mentioned above, we equip BinaryValue„ with the uniform distribution. Let Bv^ G 
BinaryValue„ be drawn uniformly at random and let j4 G ^ be a (deterministic) algorithm. 
We show the following statement below. 

(A) Prior to the t-th iteration, the set of still possible target strings has size at least 2""*"*"^ 
and that all of these target strings have the same probability to be the desired target 
string. 

Consequently, the probability to query the correct bit string in the i-th iteration, given that 
the algorithm has not found it in a previous iteration, is at most 2""'"'"*"^. This shows that the 
expected number of iterations E[r(BV2,yl)] until algorithm A queries the target string z can 
be bounded from below by 

n n n 

queries z m the i-th iteration] > ^ i • 2""+*"^ = ^ (n - i + 1) 2"* 

i=l i=l 

n n 

(n + l)j;2--^z2-\ (5) 

i=l i=l 



i=l i=l i=l 

n n 
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A simple, but nonetheless very helpful observation shows 

n n n 

i=l i=l i=l 

n-1 

= (l-2-'^) + 2-i J]]z2-S 

i=l 

yielding Y.^^^ i 2"* = 2(1 - 2"") - n2-" = 2 - (n + 2)2"" . 
Plugging this into ([5]), we obtain 

E[r(Bv^, ^)] > (n + 1)(1 - 2"") - (2 - (n + 2)2"") >n-l. 

This proves min^g_4 E[T(Bv2, A)] > n — 1 for Bv^ taken from BinaryValue„ uniformly at 
random. Yao's minimax principle implies that for any distribution q over the set of deterministic 
algorithms we have max^gjo,!}™ E[T(BV2, Ag)] > minyig_4 E[r(Bv^, A)] > n — 1. That is, the 
ranking-based black-box complexity of Binary Value„ is larger than n — 1. 

It remains to prove (A). Let t < n and let x^^\ . . . ,x^^^ be the search points which have 
been queried by the algorithm in the first t iterations. All the algorithm has learned about 
x^^\ . . . is the ranking of these bit strings induced by Bv^, i.e., it knows for all i,j S [t] 
whether Bv^(xW) > Bv^(x(j)), or Bv^{x^'^) < Bv^(x(j)), or Bv^(a;W) = Bv^(x(j)). Note 
that Bv^(x'-*^) = BVz{x^-'^^ implies x*-*^ = x^^\ Thus, this case can be disregarded as one 
cannot learn any additional information by querying the same bit string twice. 

As in Lemma [20l we set iij := max{/c G [n] \ x^*^ 7^ ^k^} ^'i ^ W ^^^^ we set 

C:={i,,j\i,j e[t]}. 

Let i ^ C and let i,j G [t] such that max{A; € [n] | x^*^ / ^k^} = ^- We can fix = x^*^ if 
Bv^xW) > Bv,{x^^^), and we fix = x'-p if Bv^x^) < Bv^x^^^). That is, we can fix |£| 
bits of z. 

Statement (A) follows from observing that for every bit string z' with = ze for all ^ G £ 
the function Bv^' yields exactly the same ranking as BVj;. Hence, all such z' are possible target 
strings. Since there is no way to differentiate between them, all of them are equally likely to be 
the desired target string. 

Furthermore, it holds by Lemma [2U] that \C\ < t — 1. This shows that, at the end of the t-th 
iteration, there are at least 2"~(*~^) possible target strings. By definition of C, the algorithm 
has queried at most one of them. Consequently, prior to executing the (t + l)-st iteration, there 
are at most 2""*^*"^-' — 1 > 2"~* bit strings which are equally likely to be the desired target 
string. This proves (A). □ 

Note that already a much simpler proof, also applying Yao's minimax principle, shows the 
following general lower bound. 

Theorem 21. Let T he a class of functions such that each f ^ J- has a unique global optimum 
and such that for all z G {0, 1}" there exists a function fz^^ with z = argmax/^. Then the 
unrestricted ranking-based black-box complexity of J- is il(n/logn). 

6 Conclusions 

Motivated by the fact that (i) previous complexity models for randomized search heuristics 
give unrealistic low complexities and (ii) that many randomized search heuristics only compare 
objective values, but not regard their absolute values, we added such a restriction to the two 



n 

= (1 - 2-") + 2-1 - 1) 2-(^"i) 

i=l 
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existing black-box models. While this does not change the black-box complexity of the OneMax 
function class (this remains relatively low at 0(n/logn)), we do gain an advantage for the 
BinaryValue function class. Here the complexity is O(logn) without the ranking restriction, 
but Q{n) in the ranking-based model. Our results thus show that for many (but not all) 
optimization problems, adding the ranking-basedness condition yields more realistic difficulty 
estimates than the previous black-box models. 
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